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Abstract — In an incoherent dictionary, most signals that admit 
a sparse representation admit a unique sparse representation. 
In other words, there is no way to express the signal without 
using strictly more atoms. This work demonstrates that sparse 
signals typically enjoy a higher privilege: each nonoptimal 
representation of the signal requires far more atoms than the 
sparsest representation — unless it contains many of the same 
atoms as the sparsest representation. One impact of this finding 
is to confer a certain degree of legitimacy on the particular atoms 
that appear in a sparse representation. This result can also be 
viewed as an uncertainty principle for random sparse signals 
over an incoherent dictionary. 

I. Introduction 

The purpose of this paper is to develop a new class of 
uncertainty principles for sparse representation that hold even 
when the sparsity level approaches the ambient dimension. We 
begin with a discussion of the background and related results 
before moving on to the new contributions. 

A. Sparse Representation in Dictionaries 

Let *& be an m x N matrix with normalized columns: 

I|V,II 2 = 1, .7 = 1,2,..., N. 

We refer to $ as a dictionary and to its columns as atoms. 
Assume the atoms span the ambient space C m . 

There are two simple geometric quantities associated with 
a dictionary. The first is a measure of redundancy: 

p=\m\\ 

where || || denotes the spectral, or £2 — > ^2 operator norm, of a 
matrix. We always have p > N/ m. Equality holds if and only 
if $ is a tight frame. The second quantity is the coherence: 



max !(<£,-, 



<Pk) 



The coherence is small when the angle between each pair of 
atoms is large. Strohmer and Heath HI have observed that 



N 



(1) 



m(N - 1) 

In the typical case N > 2m, the inequality (Q3 indicates that 
the coherence cannot be very small: fi > mT 1 / 2 . 

Let S be a subset of {1,2,..., N}, and define $5 to be the 
column submatrix of 3? whose columns are listed in S. We say 



that S is linearly independent if it lists a linearly independent 
family of atoms. Note that <&s is injective if and only if S is 
linearly independent. Suppose that a signal u € C m can be 
written as 

u = &x where supp(x) C S. 

We call the vector x a representation of the signal u, and 
we say that u can be represented with S. When S is linearly 
independent, x is the unique representation of u over S. 

In a redundant dictionary (N > m), each signal has an 
infinity of representations. The sparse representation problem 
asks us to express u with the fewest number of atoms: 



mm \ z 



subject to 



u = &z, 



(2) 



where || || counts the number of nonzero components in its 
argument. If a; is a minimizer of this mathematical program, 
the set S = supp(a;) must be linearly independent. Otherwise, 
we could remove an atom to obtain a sparser representation. 
As a result, when studying sparse representation, we focus on 
linearly independent sets of atoms. 

B. Uniqueness of Sparse Representations 

One might wonder when the problem (f2]i has a unique 
solution. The sparse approximation literature took up this 
inquiry about ten years ago, although one can trace some of 
the ideas to the late 1980s 13. The early research led to the 
following result for deterministic signals. 

Proposition 1: Assume that 



\s\ < z^- 1 



1). 



(3) 



If a signal u = <&x with supp(a;) C S, then x is the unique 
minimizer of (ff]). 

Donoho and Huo established this result for dictionaries con- 
sisting of two orthonormal bases 0; Gribonval and Nielsen 
proved that it holds for every dictionary J4). Subsequently, 
Donoho and Elad showed that Proposition [T] follows from a 
more general result, phrased in terms of the Kruskal rank, 
or spark, of the dictionary 0. Another line of work O, 
|0) sharpened the condition ([3]) for dictionaries consisting of 
multiple orthonormal bases. See J7) for a detailed discussion 
of the spikes and sines dictionary. 

The requirement (O is very stringent: it typically demands 
that the sparsity level \S\ < s/rri. In spite of this apparent 



shortcoming, the condition (0 cannot be improved in general. 
For example, when m is a perfect square, the Dirac comb can 
be represented perfectly using s/m spikes or y/m sines J2) . To 
move past the square-root threshold, we must place additional 
restrictions on the sparse signals we are willing to consider. 

A natural approach is to introduce some randomness. Let S 
be linearly independent, and let x e C s be a random vector 
whose distribution is absolutely continuous with respect to 
the Lebesgue measure on C s . We say that a random signal of 
the form u = $>s x is generic, and we refer to the (unique) 
representation of u over S as the natural representation. 

To obtain interesting uniqueness results for generic signals, 
we impose some additional hypotheses. We say that the 
dictionary $ is a weakly incoherent tight frame if 

||*f = - and M<r^T7, (4) 
to log TV 

where c is an absolute constant. Assume the sparsity level 



In this setting, we have the following result. 

Proposition 2: Assume the dictionary $ satisfies (|4|i and 
the sparsity s satisfies (0. Draw a uniformly random set S of 
s atoms from the dictionary. Except with probability 0(iV _1 ), 
the following statement holds. 

Let u = &s x be a generic signal. With probability one, the 
natural representation of u is the unique minimizer of (0. 

Roughly speaking, Proposition [2] states that a generic sparse 
signal over a random set of atoms is unlikely to have any other 
representation that is equally sparse — even when the sparsity 
level is nearly proportional to the ambient dimension. 

Candes and Romberg established the first theorem of this 
type in the specific case of the spikes and sines dictionary 1181 . 
Using different methods, the present author showed that anal- 
ogous results hold for any strongly incoherent dictionary J9] 
Sec. 7]. The extension to weakly incoherent dictionaries re- 
quires additional ideas from ifTUl Sec. 5]. 

C. Uncertainty Principles 

Historically, the sparse approximation community has 
viewed uniqueness through the lens of uncertainty principles. 
Suppose that a signal has two (different) representations: 

u = = &tV- 

The Donoho-Elad dictionary uncertainty principle Thms. 3 
and 5] stated that 

\S\ + \T\ > fi- 1 . (6) 

In particular, if a signal u can be represented with a set S 
that satisfies ©, then every alternative representation requires 
strictly more atoms. 

Since the coherence usually satisfies fi > to™ 1 / 2 , the dictio- 
nary uncertainty principle only operates in the regime of very 

'Donoho and Elad express their uncertainty principle (5] Thm. 3] in terms 
of the Kruskal rank of the dictionary, which is notoriously difficult to estimate. 
The result quoted here provides the best general bound. 



sparse representations: \S\ < ^fm. Except for very structured 
(or very random) dictionaries, it does not seem possible to 
obtain dictionary uncertainty principles for arbitrary signals 
that hold at sparsity levels near the ambient dimension. 

D. The Sparsity Gap 

This paper describes uncertainty principles for generic sig- 
nals. It is easy to appreciate why generic signals might behave 
better than adversarially chosen signals. If there are small 
sets S and T of atoms for which rangers) and rangc($T) 
intersect, there exists a signal that has sparse representations 
over both S and T. (Witness the Dirac comb!) On the other 
hand, it is hard for a generic signal to have two sparse rep- 
resentations because range($T) rarely contains rangers)! 
This fact offers a plausible route to reach uncertainty principles 
at sparsity levels far greater than ^fm. 

Our first result extends the dictionary uncertainty princi- 
ple (|6]l to generic signals. The proof appears in Section [TVl 

Theorem A (Sparsity Gap under Strong Incoherence): 
Suppose that S is linearly independent, and draw a generic 
signal u in range(^s). Then, almost surely, we cannot 
represent u with a set T disjoint from S unless 

\S\ + \T\ > fx-'VlS]- 

When \S\ = 1, our result coincides with the dictionary 
uncertainty principle (|6}, but it becomes increasingly strict 
requirements as the sparsity level \S\ increases! Indeed, an 
equivalent condition is 

iri> feH- |si ' 

so a generic signal that uses \S\ -C atoms cannot be 
represented with any disjoint set T of atoms unless \T\ 3> |5|. 
In the extreme case where /i~ 2 ~ m, we obtain an uncertainty 
principle that operates at sparsity levels proportional to the 
ambient dimension! 

Our second result is an uncertainty principle that parallels 
Proposition |2 just as Theorem lAl parallels Proposition [T] This 
proof appears in Section [V] 

Theorem B (Sparsity Gap under Weak Incoherence): 
Assume that $ is a weakly incoherent tight frame that 
satisfies ©, and assume further that N > 2m. Suppose that 
S is a randomly chosen set of s atoms, where s satisfies (0. 
Except with probability 0(A r ~ 1 ), the following holds. 

Draw a generic signal u in rangers). Then, almost surely, 
u cannot be represented with a set T disjoint from S unless 




The redundancy p = N/m, by hypothesis. 

In words, Theorem [B] considers a generic signal over a 
random set of atoms. It is likely that every (disjoint) alternative 
representation requires a constant factor more atoms than the 
natural representation, where the extra factor decreases as 
the dictionary becomes more redundant. We see that there is 



typically a sparsity gap between the natural representation and 
the the sparsest representation that uses different atoms. 

This result provides an interesting guarantee for a huge class 
of dictionaries because of the weak bound for the incoherence. 
On the other hand, it holds for a smaller class of signals than 
Theorem [A] because we have randomized the set of atoms in 
addition to choosing generic coefficients. 

II. Rank and File 

Although a generic signal has many representations aside 
from the natural one, there is a large class of representations 
that we can almost surely rule out. As a first step toward our 
main results, we develop an algebraic condition that describes 
which representations can and cannot occur. 

To motivate the discussion, let us recall a standard argument 
for establishing dictionary uncertainty principles. Suppose that 
both S and T are linearly independent. A few moments of 
thought reveals that the following conditions are equivalent: 

1) We have range(* s ) n range(*r) = 0. 

2) The matrix <5> R has full rank, where R = S U T. 

For a fixed set S, suppose that Condition 2) holds whenever 
\R\ < r*. We conclude that, if there exists a signal that has 
representations over both S and T, then |5| + \T\ > r*. Read 
the paper J5) to see this argument in action. 

We can extend this methodology by quantifying the rank of 
the matrix These bounds allow us to count how many 
extra atoms are needed to represent a generic sparse signal. 

Lemma 3: Suppose that both S and T are linearly indepen- 
dent. The following conditions are equivalent. 

1) We have range(#g) n range(^T) £ ran g e (*s)- 

2) We have \T\ < rank(* fl ), where R = S U T. 

Proof: Define the subspaces = range(«l>s) an d & = 
range(3>r), which implies that 5? + 5? — rangc($/j). Note 
that 5? n & is a proper subspace of 5? if and only if 

dim(j^ (15)< dim(j^) (7) 

The algebra of subspaces yields 

dim(j^ n &) = dim(j^) + dim(^) - dim(^ + 

Therefore, the condition (Q is equivalent with 

dim(^) < dim(j^ + £T). 

Since T is linearly independent, dim(^) = \T\. Meanwhile, 

dim(,y + ,3?) = dim(range($ij)) = rankf^^). 

This is the required conclusion. ■ 
Let us translate the previous result from the language of 
subspaces to the language of probability. 

Corollary 4: Suppose that both S and T are linearly inde- 
pendent. The following conditions are equivalent. 

1) A generic signal u = 3>s x almost surely has no 
representation of the form u = &tV- 

2) We have \T\ < rank(* fl ), where R = SUT. 
Proof: Lemma [3] states that Condition 2) is the same as 

rangers) n rangc(*T) £ rangers), (8) 



so we prove that Condition 1) is the same as (H). To that end, 
let u = *f?sx be a generic signal, which means that x is 
absolutely continuous with respect to the Lebesgue measure 
on C . Let v denote the Lebesgue measure on rangers). 

First, assume I© holds. A proper subspace has zero 
Lebesgue measure, so 

f (range(«&s) n rangc(€>T)) = 0. 

The set S is linearly independent, so $5 is injective. As 
a result, the distribution of u is absolutely continuous with 
respect to v. It follows immediately that 

P{ti£ rangers) n rangc(€»T)} = 0. 

We conclude that 

P{ti£ range(*r)} = 

because the signal u G range(^g). 

Conversely, suppose ([8]l is false. Then rangc($s) C 
range($T), so the signal u can be represented over T. ■ 

It is convenient to remove the assumption of linear inde- 
pendence from the previous result. 

Corollary 5: Suppose that S is linearly independent, and 
let T be any other set of atoms. Assume that 

\T\ < rank(* fi ), where R = S U T. 

Draw a generic signal u = &s x - Then 

P{ti£ rangc(* T )} = 0. 

Proof: When T is linearly independent, the claim follows 
directly from Corollary [4] Otherwise, extract a maximal linear 
independent subset T' from T, and write R' = S U T". Apply 
the result to T' to obtain the statement 

\T'\ < rank($ fl <) ¥{ue range(* T ')} = 0. 

Since T' is maximal, rank(4»fl/) = rank($^) and also 
range(^T') = range(^T)- To complete the proof, note that 
the hypothesis \T\ < rank(€>fl) implies \T'\ < rank(€>fl) 
because T' C T. ■ 

III. Analytic Rank Estimates 

The main challenge is that we only possess ana- 
lytic/geometric information about the dictionary, encapsulated 
in the redundancy p and the coherence fi. But the rank is 
fundamentally an algebraic quantity. Our approach will be to 
construct analytic estimates for the rank that we can compute 
from the data at hand. 

A. Schatten Norms 

A primary tool is the Schatten class of matrix norms. Let A 
be a matrix, and write er(A) for the vector of singular values 
of A, arranged in weakly decreasing order. The Schatten p- 
norm is defined as 

\\A\\ Sp = \\a(A)\\ p , 



where ||-|| is the usual l v vector norm. In particular, S2 is 
the Frobenius norm, and Soo is the spectral norm. The norm 
S\ is often called the trace norm because 

J| A\\ s = tracc(A) when A is psd. 

The term psd abbreviates positive semidefinite. For general 
matrices, the Frobenius norm is the only Schatten-class norm 
computable directly from the matrix entries: 

1/2 



|A|| F = 



B. Rank Bounds via Norm Ratios 

A simple but powerful method for estimating rank is to 
compare two different Schatten norms of the same matrix. 
Lemma 6: Suppose that p < q. For each matrix A, 

vq/(q-p) 

rank(A) > 

Proof: For each vector x £ C r , we have the inequality 

< r l/p-l/3 



Indeed, one can use Lagrange multipliers to verify that the 
left-hand side is maximized when a; is a constant vector. 

Suppose that rank(A) = r. Then the vector cr of nonzero 
singular values of A lies in C r . By definition of the Schatten 
norms, 

^ A ^Sp = i^P < r i/p-i/g. 



Take the (1/p— l/q) root and simplify the exponent to reach 
the conclusion. ■ 

The following simple corollary is fantastically useful. 

Corollary 7: Let A be a matrix. Then 



rank(A) > 



I Si 



and rank(A) > 



IAI 



Alon has applied the first estimate in his work on extremal 
combinatorics flTTl . The second estimate arises in a paper of 
Bourgain and Tzafriri on restricted invertibility lfl2l . 

C. A Schur Complement Rank Identity 

Suppose that X is a psd matrix, partitioned so that its 
diagonal blocks are square: 



X = 



A 

B* 



B 

C 



Provided that the block A is nonsingular, the Schur comple- 
ment of A in X is the matrix 

X/A = C - BA l B*. 

For our purposes, the relevant fact is that 

rank(X) = rank(A) + rank(X/A). (9) 

See lfl3l Sec. 2] for more Schur complement identities. 



IV. Sparsity Gap under Strong Incoherence 

Corollary [5] indicates that we can obtain uncertainty prin- 
ciples for generic signals by developing lower bounds on the 
rank of a subdictionary This section describes the simplest 
approach to this problem, which proceeds via Corollary UJ This 
method is most effective when the coherence p is small. 

Let R be a set of atoms. Since 



rank(*^#i?) = rank(* fl ), 



(10) 



we may as well work with the Gram matrix of This 
substitution allows us to exploit geometric information about 
the dictionary. Indeed, the diagonal entries of &* r &r equal 
one because the atoms have unit £2 norm, and the off-diagonal 
entries are bounded in magnitude by p because they contain 
the inner products between distinct atoms. 
Lemma 8: Let R be a set of r atoms. Then 

rank( * fl) ^ 1-Mr-l)^ - 
Proof: Relation ( TTOb and Corollary [7] imply that 



rank(*fl) = rank(**3>fl) > 



I*r*rIU 



ii*«*Hii; 

Owing to the properties of the Gram matrix, 



(ID 



(trace &* r $r) 2 



and 



H^^Hf <r + r(r-l> 2 

Introduce these bounds into (fTTb to complete the proof. ■ 
When the coherence p is small, Lemma[8]provides excellent 

estimates for the rank of &r. By combining this bound with 

Corollary [5] we obtain our first main result. 

Theorem 9: Suppose that S indexes a linearly independent 

set of s atoms and that T lists t atoms. Assume that the size 

S of their intersection 



\Sr\T\ < s 



1 



t- 1 



s l-tp 2 
Let it = &s x be a generic signal in rangers) 
F{ue rangc(* T )} = 0. 



(12) 



Then 



A fortiori, there is zero probability that it can be represented 
using any set T of t atoms whose overlap with S equals S. 
Proof: Define the set R = S U T. Observe that 

r := \R\ = \8\ + \T\ - \S n T\ = s + 1 - 6. 

Corollary \5\ states that t < rank($^) implies 

P{u e rangc(* T )} = 0. (13) 

According to Lemma [8] 

< rank(3>i?). 



1 + (r - l) A i 2 
Therefore, another sufficient condition for ( fT3T > is 

r s + t — 5 



t < 



l + (r-l)/i 2 l + (s + t-5-l)p? 



Solving this relation for 8 results in the bound 



8 < s - 



(t l)tp 2 



l-tp? ' 

If 8 satisfies this condition, then JTJI i is in force. 

Since only a finite number of sets T meet the hypotheses 
of the theorem, there is zero chance that u is representable 
using any such set T of atoms. ■ 

In words, Theorem|9] states that a generic signal constructed 
using a set of s atoms almost surely has no representation 
using another set of t atoms unless there is a substantial 
overlap between the two sets. 

For example, suppose fj, < mT 1 ! 2 , and let it be a generic 
signal in range($s), where \S\ < m/3. Take t — s in 
the overlap condition ( fT2b to see that, almost surely, every 
representation of u with s atoms requires at least s/2 atoms 
from SI 

To obtain an uncertainty principle, note that the relation ( fT2] i 
is quadratic in t. By reverting this inequality, we obtain a 
sufficient condition on t as a function of s and 8. 

Corollary 10: With the notation of Theorem [9] assume 



t<(s-5-l) 




1 



s-S-1 



1 



8-1 



- 1 



Then P{tt e rangc(* T )} = 0. 

After a considerable amount of algebra, Corollary [10] sim- 
plifies to a new uncertainty principle. 

Corollary 11 (Uncertainty Principle for Generic Signals): 
Suppose S is linearly independent. A generic signal u in 
rangc(<&s) almost surely has no representation over a set T 
with overlap 8 — \S T\ unless 



\S\ + \T\> 8 + ^ 1 ^\S\~S. 
Theorem lAl follows from Corollary QT] by setting 8 = 0. 

V. Sparsity Gap under Weak Incoherence 

In this section, we use a different technique to bound 
the rank of 3>b below. This approach relies on the Schur 
complement identity (O and the second part of Corollary[7] We 
also require some information about the properties of random 
sets of atoms, drawn from J5), iflOl . The conclusions are most 
interesting for weakly incoherent dictionaries. 

Assume S is linearly independent, which implies that the 
Gram matrix &* s $>s is invertible. Let V be an index set 
disjoint from S, and write R = S U V. Then the Gram matrix 
has the block structure 



The identity (O shows that we can control the rank of the 
Gram matrix by controlling the rank of the northwest block 
and its Schur complement. To state the result, we recall that 

p s = ^(^^s)- 1 *! 

is the orthogonal projector onto range(«I>s). 



(14) 



Lemma 12: Suppose that S is linearly independent; let V 
be disjoint from S; and define R = S U V. Then 

rank(* H ) = \S\ + rank((I - P s )*v)- 

Proof: The Schur complement identity (O shows that 

rank(#^*i{) = rank^g'&s) + r&nk(<fr* R <frfi/&* s <frs)- 

The set S is linearly independent, so 

rank($J* s ) = rank(* s ) = \S\ . 

Next, recall the definition of the Schur complement: 

Identify the orthogonal projector onto range to see that 

We conclude that 

rank(*|j* H /*5* s ) = rank((I - P s )*y) 

because (I - P s ) 2 = (I - P s ). ■ 
The next result provides a lower bound on the second term 

in Lemma Q~2] Here and elsewhere, the dagger f denotes the 

pseudoinverse of a matrix. 

Lemma 13: Suppose that S is linearly independent, and let 

V be disjoint from S. Then 



rank((I - P 5 )*v) > 

p- 1 \V\ (l - ||*^|| 2 max^ s WsVvWl 
Proof: Corollary [7] states that 



rank((I - P s )$v) > 



\(I-P s )*v\\i 



(15) 



ii(i-p s )#vir 

The spectral norm satisfies the bound 

||(I-P s )*v|| 2 < ll*v|| 2 <P 

because $y is a column submatrix of With a little more 
work, we can produce a lower bound on the Frobenius norm. 



|(I-P s )<Mf=E eV WV- p s)>Pv 



E, 



>\V\ (i- 



\ P Sfv\\ 2 ) 
2 



|$t 



\*S<Pv\ 



l* f I 

I v s\ 



max || &s<Pv I 



vis 



The first inequality follows by expanding the projector via (fl4l i 
and invoking the usual operator norm bound. Introduce the two 
norm estimates into (T5[ to complete the argument. ■ 
Lemma [T3l is interesting because a random set S of atoms 
from a weakly incoherent dictionary usually leads to small 
values for the mysterious quantities in the rank bound. 



Proposition 14: Suppose that S is a uniformly random 
subset of {1, 2, . . . , s} with cardinality s. For /3 > 1, 



max ||*sV«ll 2 > C ^ 



y/logN + 2 




ill < C/3 



H log A 



ps log A 

TV 



< 2N- 



< 2N~ 



The number C is an absolute constant. 

Proposition [14] follows from the results in flOl Sec. 5] 
after some standard arguments. We use Markov's inequality to 
convert the moment bounds into tail bounds. Then we invoke 
a simple decoupling argument (see JT] Lem. 14] for a model) 
to transfer the result for a random set with expected cardinality 
s to a random set with exact cardinality s. 

To take advantage of Proposition [14] we restrict our at- 
tention to a specific class of dictionaries. Assume that 3? is 
a weakly incoherent tight frame, as defined in (0). Suppose 
furthermore that the sparsity s satisfies ©. If we fix a 
sufficiently small value for constant c and take (3 = 1, 
Proposition [14] ensures that 



< V2, 



maxygs ||*sVt)||2 ^ V 2 and 

except with probability 0(N^ 1 ). The upshot of this discussion 
is the following bound. 

Corollary 15: Assume the hypotheses of Lemma [T3l are in 
force. Let R = S U V. Then 

2m\V\ 



rank(* fl ) > \S\ 



N 



We reach our final major result by introducing this bound 
into Corollary [5] 

Theorem 16: Assume that <f> satisfies the weak incoherence 
conditions (O, and assume further that A > 2m. Draw a 
random set S of s atoms, where s satisfies ©. Except with 
probability 0(A _1 ), the following result holds. 

Suppose that T lists t atoms. Let 6 = \S P\ T\ be the size 
of the overlap with S, and assume that 



t < 



25m 



1 - 



2m 
~A 



Draw a generic signal u = 3>s x. Then 

P{tt G rangc(<&T)} = 0. 

A fortiori, there is zero probability that u has a representation 
over any such set T of atoms. 

Proof: Draw a random set S of s atoms. Corollary [T5l 
guarantees that, with probability 0(A~ 1 ), the following result 
holds. For any given set T of t atoms, define V = T\ S and 
select R = S U V. Then 

2m\V\ 



N 



< rank(# fl ). 



(16) 



Now, suppose that T is a specific set of t atoms. According 
to Corollary [5] the condition t < rank(3>R) implies that a 



generic signal in rangers) almost surely has no representa- 
tion using the atoms in T. It follows from ( TToT l that 



t < s 



2m\V\ 
N 



is a stricter sufficient condition that u almost surely cannot be 
represented with T. Introduce the identity |y | = t — S into the 
last relation and rearrange to obtain the equivalent condition 

t < I s — I I 1 



N 



2m 



As usual, we may take a union bound over the finite number 
of admissible T to obtain a uniform result. ■ 
The message may be clearer if we make additional sim- 
plifications to the sufficient condition in Theorem [16] First, 
subtract and add 2sm/N in the first parenthesis to reach 



t < 



1 



2m 
"/V 



2(s — S)m 
TV 



1 



2m 
"/V 



Since 1 < (1 — 2m/N) 1 , we find that a further sufficient 
condition is 

2(s-6)m 

(17) 



t < s + 



N 



In words, there is zero probability that a generic signal in 
range(<I>s) has a representation using t atoms unless t is 
somewhat larger than s or the alternative representation uses 
many atoms from S. Take S = in ( fTTb to reach Theorem iBl 
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